Computer Vision

8 articles tagged with "Computer Vision"

Part 1: From GANs to Generative AI — Why and How the Migration Happened Series

Part 1: From GANs to Generative AI — Why and How the Migration Happened

Google proved that generative AI can do virtual try-on. Could the same approach replace our entire GAN pipeline? This is the story of migrating from PASTA-GAN++ to Gemini and Vertex AI — simplifying a multi-stage GPU pipeline into a single API call.

Read →
Part 2: Nano Banana Virtual Try-On — 16 Test Cases and What They Revealed Series

Part 2: Nano Banana Virtual Try-On — 16 Test Cases and What They Revealed

Systematic testing of Gemini's image generation for virtual try-on across three phases: noisy inputs, clean images, and high-resolution action poses. The key finding: resolution matters more than any preprocessing pipeline.

Read →
Part 3: The 3-Engine Showdown — PASTA-GAN++ vs Nano Banana vs Vertex AI VTO Series

Part 3: The 3-Engine Showdown — PASTA-GAN++ vs Nano Banana vs Vertex AI VTO

A head-to-head comparison of three generations of virtual try-on technology across 12 test cases. The results reveal not just incremental improvement, but a generational leap — especially in body diversity, where GANs fundamentally failed.

Read →
Part 1: From Photo Booths to Virtual Try-On — The 20-Year Quest Series

Part 1: From Photo Booths to Virtual Try-On — The 20-Year Quest

The origin story of META FIT: how a decades-old vision of seeing yourself in clothes before buying evolved from hardware kiosks to GAN-powered virtual try-on, plus a comprehensive survey of 15+ VTON research models.

Read →
Part 3: Inside PF-AFN — The Try-On Engine in Code Series

Part 3: Inside PF-AFN — The Try-On Engine in Code

A code-level walkthrough of the Parser-Free Appearance Flow Network: Feature Pyramid encoding, CUDA-accelerated correlation kernels, optical flow warping, and the ResUnet generator that composites garments onto people.

Read →
Part 2: Understanding GANs — The Engine Behind Virtual Try-On Series

Part 2: Understanding GANs — The Engine Behind Virtual Try-On

A deep dive into Generative Adversarial Networks: how the generator-discriminator dynamic works, why GANs dominated image generation before diffusion models, and how they power virtual try-on systems.

Read →
Part 4: Pose Estimation, Body Measurement, and 3D Reconstruction Series

Part 4: Pose Estimation, Body Measurement, and 3D Reconstruction

How OpenPose skeletal detection, Graphonomy human parsing, and custom body measurement algorithms work together to enable accurate virtual fitting — plus an exploration of PiFu for 2D-to-3D reconstruction.

Read →
Part 5: Results, Failure Modes, and the Path to Modern Image Generation Series

Part 5: Results, Failure Modes, and the Path to Modern Image Generation

What the GAN-based virtual try-on system achieved, where it failed (and why), the smartphone app design, and how diffusion models are changing everything for the next generation of META FIT.

Read →